1. \ A method of expanding an effective active 
vocabulary bf a speech recognition system, the method 
comprising : \ 

usinif a speech recognizer to perform speech 
recognition hn a user utterance to produce one or more 
recognition candidates, the speech recognition comprising 
comparing digital values representative of the user 
utterance to a set of acoustic models representative of an 
active vocabulary of the system, the set of acoustic models 
including mode]\s of words and models of word fragments, 

receiving the recognition candidates from the speech 
recognizer, and \ 

when a received recognition candidate includes a 
word fragment : \ 

determining whether the word fragment may be 
combined with one tor more adjacent word fragments or words 
to form a proposedWord included in a backup dictionary of 
the speech recognition system; 

if the word fragment may be combined with one 
or more adjacent work fragments or words to form a proposed 
word included in a backup dictionary of the speech 
recognition system, modifying the recognition candidate to 
substitute the proposed word for the word fragment and the 
one or more adjacent word fragments or words used to form 
the proposed word; and \ 

if the word! fragment may not be combined with 
one or more adjacent woild fragments or words to form a 
proposed word included ih a backup dictionary of the speech 
recognition system, discatrding the recognition candidate. 
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1 6 The method of clX 5, wherein a valid sequence 

2 includes only one or more allowed adjacent combinations of 

3 word fragments and words. 

1 7 The method of cla^ 6, wherein allowed adjacent 

2 combinations comprise one or more prefixes, followed by a 



3 
4 

5 



comDinau j-uiia (^wmt^j- 

root or a word, followed by one or more suffixes; a root or 
a word followed by one or more suffixes; and one or more 
prefixes followed by a root or a word. 



1 8 The method of clainNe, wherein allowed adjacent 

combinations comprise one or more prefixes, followed by one 



2 
3 
4 
5 

6 words 



comuxiid-i-.'-^^^'^ ^v^.L.j-- 

or more roots or words, followed by one or more suffixes; 
one or more roots or words followed by one or more suffixes; 
and one or more prefixes followed by one or more roots or 



1 9 The method of clai>N4, further comprising 

2 co^ining the particular word fragment with the one or more 



CUUUJJ-ii-Lix^ ^^^^ i: 

adjacent word fragments or words to form a second proposed 
word that differs from the first proposed word by using a 
second associated spelling rule in forming the proposed 



3 
4 
5 

6 word . 
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1 X, 1 wherein: 
,0. The -thod of claX ; ^^^,,,.,ed with a 
one or more spelling 
. i.r word fragment, fragment with one or 

^^^^ 

fragment with the one 

X 1 therein determining 
,ne method of «th one or more 

,V,e word fragment may b>a c ^ ^^^^^^^ „„a 

:rrnt word fragments .eeon r.o^rt.n 

— the hacup diotronarv 

system comprises 



a 6 proposed word 



Q 
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12. The method of e^im 1, wherein modifying the 
recognition candidate comprises: 

forming a prospective recognition candidate by 
modifying the recognition candidate to substitute the 
proposed word for the word fragment and the one or more 
adjacent word fragments or words used to form the proposed 
word ; and 

if the prospective recognition candidate includes an 
additional word fragment: 

further processing the prospective recognition 
candidate to generate an additional word using the 
additional word fragment and one or more adjacent words or 
word fragments, and 

forming a final recognition candidate by 
replacing the additional word fragment and the one or more 
adjacent words with the additional word. 




13. The method of claimNL, wherein a score is 
associated with the received recognition candidate, the 
method further comprising producing a score associated with 
the modified recognition candidate by rescoring the modified 
recognition candidate. 

14. The method of claim\3, wherein: 
the score associated with the received recognition 

candidate includes an acoustic component and a language 
model component, and 

rescoring the modified recognition candidate 
comprises generating a language model score for the modified 
recognition candidate. 
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Xi, wherein producing the 
,3 The «thca o£ =^";^\';3Ccgnition canaidate 

the rece tnodi^iea 

generate 
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generated or ^^^^^^^^^ 

, The method o{ clain- M. ^^^ating an 

^5 The m ^ „ compris^i^ 9 candidate 

A recognition canoi ....^ recognition 

modified recog ^^^^ moditiel 

acoustic «'°<*''\;^°^„,e associated with th __^^^3i 

gr-^-sriet:^^^^^^^^ - -^^^^^^ 

the language model 
- , recognition candxdate. . 

3 W of clai>^3, wherexn- 

lU The method of c received recog 

^3 1 associated with tn language 

a ^ the score ass^ ^ component and 

• - -Sri -r.r^"»"-"" 

,^.es generating an a 
comprises y ,.->;,*- e ^ 

■ii-ion candidate.-.,,^ 

\^ wherein producing the 

,3 Xhe method o£ ^^^^^J ^^cognition candidate 
' ,,,ed with the modified r the 
3core associated « ^^^^^^^^ '^"\ZL with the 

• ^^o combining .^ion candidate v. 

comprises c recognition ignition 

rstlr-eTeneiated ^r the modi.ed 
candidate . 
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1 19. The method of^^c]^ 1, further comprising 

2 generating the acoustic model of a word fragment by: 

3 comparing a word of the active vocabulary to a 

4 similar word of a backup dictionary to identify a word 

5 fragment that may be used to convert the word of the active 

6 vocabulary to the word of the backup dictionary, and 

7 generating the acoustic model of the word fragment 

8 using a portion of an acoustic model of the word of the 

9 backup dictionary that is not included in an acoustic model 
10 of the word of the active vocabulary. 

1 20. The method o^^^^cl^m 19, wherein comparing a 

2 word of the active vocabulary to a similar word of a backup 
C3 3 dictionary comprises comparing spellings of the two words. 

rj 1 21. The method of clkim 1, further comprising 

2 generating the acoustic models ot word fragments by: 
^-4 3 comparing words of the active vocabulary to similar 

O 4 words of a backup dictionary to identify spelling rules that 

h 5 may be used to convert the words of the active vocabulary to 

6 words of the backup dictionary; and 
ri 7 employing the spelling rules in identifying word 

/^D 8 fragments. v 

1 22. The method of claa^r?! 21, wherein employing the 

2 spelling rules in identifying the word fragments comprises: 

3 grouping spelling rules together to form possible 

4 affixes, the affixes including prefixes and suffixes; and 

5 analyzing words of the backup dictionary using the 

6 affixes to identify roots that may be combined with the 

7 affixes to produce words of the backup dictionary. 
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23. The method of claihi 22, further comprising 
generating acoustic models for the roots using portions of 
acoustic models of the words of the backup dictionary that 
are not included in acoustic models of the affixes. 

24. The method of c]>^m 22, further comprising 
adding affixes and roots to th^ active vocabulary as word 
fragments . 




25. The method of clai\24, further comprising 
storing a set of spelling rules in association with an affix 
in the active vocabulary. 




26. The method of claims 24, further comprising 
creating a language model associated with the active 
vocabulary. 




27. The method of claW 26, wherein creating the 

language model comprises: 

retrieving a training collection of text, the 
training collection of text comprising words from the backup 
dictionary and words from the active vocabulary; 

modifying the training collection of text by 
replacing any splittable backup dictionary words with their 
corresponding words and word fragments; and 

generating language model scores for words and word 
fragments of the active vocabulary using the modified 
collection of text. 



28. The method of claim\26, wherein creating the 
language model comprises creating an N-gram language model 
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29. The method of claim 2^, wherein creating the N- 
gram language model comprises: 

retrieving a training collection of text, the 
training collection of text comprising words from the backup 
dictionary and words from the active vocabulary; 

determining a frequency of each N-gram word sequence 
that appears in the training collection of text; 

modifying the N-gram word sequences by replacing any 
splittable backup dictionary words with their corresponding 
words and word fragments; 

based on the N-gram word sequence frequencies, 
determining a frequency of each modified N-gram sequence 
that includes words, word fragments, or words and word 

fragments; and 

based on the N-gram word and word fragment sequence 
frequencies, generating an N-gram language model for the 
words and word fragments of the active vocabulary. 




30. The method of cla^'l, wherein word fragments 
comprise syllables, and each syllable includes a unit of 
spoken language . 

31. The method of claiVso, wherein the unit of 
spoken language comprises a single uninterrupted sound 
formed by a vowel or diphthong alone, of a syllabic 
consonant alone, or of either with one or more consonants, 

32. The method of c^ba^m 30, wherein a syllable 
comprises a vowel and one or mote consonants clustered 
around the vowel . 
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33. The method of clai>fi 1, wherein determining 
whether the word fragment may be combined with one or more 
adjacent word fragments or words to form a proposed word 
included in the backup dictionary of the speech recognition 
system comprises searching the backup dictionary for the 
proposed word based on a pronunciation of the proposed word. 

34\ A method of recognizing speech, the method 
comprising ; 

usiiL a speech recognizer to perform speech 
recognition bn a user utterance to produce a set of one or 
more recognition candidates, the speech recognition 
comprising co\iparing digital values representative of the 
user utterance\to a set of acoustic models representative of 
an active vocabulary of the system, the set of acoustic 
models including models of words, models of roots that are 
not words, and Aodels of affixes that are not words, the 
affixes including prefixes and suffixes, 

receiving\ the recognition candidates from the speech 

recognizer, and 

when a received recognition candidate includes an 

affix : 

combining the affix with one or more adjacent 
words, roots, or other affixes to form a new word; and 

modifyim the recognition candidate to 
substitute the new wdtd for the affix and the one or more 
adjacent words, roots\ or other affixes used to form the new 
word 



- 86 - 



w 

u 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 



3, . method of generating an acoustic ™=del o£ a 

„ord fragment, the ''^^ ^^^^^^ to a 

comparing a word of an active 

^ f « backup dictionary to identity 
similar word of a backup ^^^^ ,,,,,, 

fragment that may be used t ^,,,i„„ary, and 

vocabulary to the word of *e ba P ^^^^^^^ 
generating the acoustic model 

using a portion - » -^^rrdeAl an acoustic model 
backup dictionary that 

£ 4-1-,^ ar-tive vocsibuiary. 
of the word of the active 

V, ^ rlaitAs, v/herein comparing a 
3e. The method of claim , ^^^^ ^ ^^^^^^ 

„ord of the active .^^^^s of the two words, 

dictionary comprises comparing spelling 

fragments, the method comprising: ^^^^^^^^^^ ^.^.^^^ 

comparing "ords jf ^ ac ^^^^^^^^ ^^^^^ 

„„rfs of a backup dictionary t vocabulary to 

„ay be used to convert the words of 

„ords of the backup identifying word 

employing the spelling rules 



^ A r.f nlainNv. wherein employing the 
33. ->>---^j;;4"l,',,agments comprises: 
spelling rules ^^^^l^^J^^ ,o,,,,er to form possible 
grouping spelling .-^^^ and suffixes; and 

. affixes, the af fixes inc^ prefi^^^^^ ^^^^^ 

= rlrtir'otl --ay be co^ined With the 

' :: :::: rolce^-- the backup dictionary. 
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'^0 8 fragments 
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39 The method of olal\38, further comprising 
generating acoustic models for the roots using P^'ions of 
acoustic models of the »ords of the backup dxctxonary that 
are not included in acoustic^odels of the affixes. 



40 The method of olaimNs, further comprising 
adding affixes and roots to the active vocabulary as word 

fragments . 



41 The method of clai^40, further comprising 
storing a set of spelling rules in association with an affix 

in the active vocabulary. X^^^ 

42 The method of claim 4^, further comprising 
, creating a language model associated with the active 
3 vocabulary. 



43 



The method of claimX wherein creating the 



2 language model comprises: 

retrieving a training collection of text, the 
training collection of text comprising words from the backup 
dictionary and words from the active vocabulary; 

modifying the training collection of text by 
replacing any splittable backup dictionary words wxth thexr 
corresponding words and word fragments; and 

generating language model scores for words and word 
fragments of the active vocabulary using the modified 



3 
4 
5 
6 
7 
8 
9 
10 

11 collection of text. 

44 The method of clailN2, wherein creating the 
, language model comprises creating an N-gram language model 
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45. The method of clain\44, wherein creating the N- 
gram language model comprises: 

retrieving a training collection of text, the 
training collection of text comprising words from the backup 
dictionary and words from the active vocabulary; 

determining a frequency of each N-gram word sequence 
that appears in the training collection of text; 

modifying the N-gram word sequences by replacing any 
splittable backup dictionary words with their corresponding 
words and word fragments; 

based on the N-gram word sequence frequencies, 
determining a frequency of each modified N-gram sequence 
that includes words, word fragments, or words and word 

fragments; and 

based on the N-gram word and word fragment sequence 
frequencies, generating an N-gram language model for the 
words and word fragments of the active vocabulary. 
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/ 46 VA computer- implemented speech recognition 
2^^em that Le an expanded effective active vocabulary, 

1-hp svstem comprising: 

a etola^e device configured to store an act.ve 
vocabulary thLt include, multiple entries corresponding to 
words, commanks, and word fragments; and 
a oroctessor configured to: 

Iceive data representing a user utterance, 
ploduce one or more recognition candidates, by 
J. 'A^i ^r^lues representative of the user 
Te::::: -^^^^ representative of the 

Ic i" : cbulal of the system, the set of acoust.c models 

71- model of words and models o£ word fragments, 
including model^s\ o^f ^^^^^^^^ candidate includes 

a word word fragment may be 

. .dnacenV word fragments or words to form a 
one or more adjacent wui.'^ => ^^^ifv the 

n ■ 1 Ld iT^ a backup dictionary, modify tne 

:rg::::o:i:rd:hrs:bstitu!e t. proposed word for 

Ld fragment a^^ ^^^rd^^^^^^^^ 
fr-aaments or words ud|ed to form tne px^F 

fragments ^^^^ ^^^^^^^^ combined 

;:t = -rj±: rr„s= -:rr::.: 

proposea ^A^^^d the recognition candidate, 

recognition system, di^^card the r 
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of claim\, wherein the expanded 

47. The system of ^^^"^ ^es words from the backup 
e-ective act.ve Of words and 

dictionary that are ^J^^ and word fragments from an 

.ord fragments or word fr g ^^^^^ ^^^^ fragments, 

,,,,.e vocabulary, 
and words from the active o^ 

of Claire, wherein the processor 
48. The system of claim , combined with 

determines whether the ^^l^^^: ^l^,^ form a 

or more ^^^^^ dictionary by searching 

proposed word -^^-fj^/ ^..^.^ed word, 

the backup dictionary for^^P 

4S .he system of ci>m4e, Wherein the processor 

S ; ^odif ies the reco^ ce^^;^^^^^,^^^^^^ 

3 forming a prospecti substitute the 

6 adjacent word f ragmen 

b 7 word; and recognition candidate includes an 

\0 g if the prospective recoy 

additional word fragment: .^pective recognition 

further P^^^^^^^^^al word using the 
candidate to generate an addit ona ^^^^ 
additional word fragment and 

13 word fragments, and ^^^^g^ition candidate by 

forming a fmai rec y ^^^^ 
u ^rlitional word fragment and the one 
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50 The system of clai>46, wherein a score is 
..social With the receivea 

processor is further configured to produce a score 
rrciated wrth the modified recognition candidate .y 
rescoring the modified recognition candidate. 
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4= oi^m 51, wherein the 

expanded effective active ^^^^ ^ ,o*inaticn of 

the backup dictionary ^at are ^^^^ 

and word f ra.™en o wo- ^ . ^^^^^^^^ 
fragments from an active ^ TOcabulary. 

word fragments, and words from the acti>^ 

^ ^-F nl;:iim 51/ wherein 

rthrtrir^rgiei:: .e co..ned w.th 

determining * ^/^^^ ,„^,„,3 or words to form a 

one or more adjacent dictionary comprises 

p,„posed word included in the ^^f^^^^ ,„,d. 
searching the bacKup dictionary for theX^P 

3, The computer software of ciaim SI, wherein 

.„,,.,i„g the ;:rrdidate by 

forming a prospectx substitute the 

- ToX^"^^^^^^^^^^^ an. the one or .ore 
e rrc::: -.rnt. or ^or.. use. to .r. the propose. 

; tf the prospective recognition candidate includes an 

additional word fragment: prospective recognition 

further processing the prospe 

. traditional word using the 

candidate to generate an additi ^^^^^^^^ ^^^^^ 

additional word fragment and one 

„ord -agment.^;- ^ ^^^^^ h 

replacing the additional — 

^= wTth the additional word, 
adjacent words with tne 
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55. The computer software of clain>51, wherein a 
score is associated with the received recognition candidate, 
the computer software comprising instructions for causing 
the computer to produce a score associated with the modified 

5 recognition candidate by rescoring the modified recognition 

6 candidate. 
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